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Abstract. We establish large deviations properties valid for almost every sample path of a 
class of stationary mixing processes (X\, . . . , X n , . . .). These properties are inherited from 
those of S n = Y17=i Xi and describe how the local fluctuations of almost every realization 
7— I of S n deviate from the almost sure behavior. These results apply to the fluctuations of 

Brownian motion, Birkhoff averages on hyperbolic dynamics, as well as branching random 
walks. Also, they lead to new insights into the "randomness" of the digits of expansions in 
integer bases of Pi. We formulate a new conjecture, supported by numerical experiments, 
^ implying the normality of Pi. 

Q 

i — i 1. Introduction 

P< 

Q_i Given a sequence of i.i.d. real valued random variables (X n ) n >i, large deviations theory 

provides a precise estimate of the probability that the random walk S n = Ya=i deviates 
from its almost sure asymptotic behavior, as long as X possesses finite exponential moments 
on a non trivial domain. In particular, if A(A) = logE(exp(AX!)) is finite over an interval 
X>A whose interior contains then (see Cramer's theorem in [12] Ch 2.2 or Gartner-Ellis' 
^ theorem at the end of this section) 

> . 1 

cn (l.l) Vx G A'(V A ), lim lim -loguJlx - e,x + e]) = -A*(x), 

t— i £ ->o »-><» n 

where fM n is the distribution of S n /n and A*(x) = sup AeK {Ax — A(A)}. Notice that the case 
^ x = A'(0) = E,(Xi) corresponds to the almost sure asymptotic behavior of S n (u) given by 
the strong law of large numbers: S n (u)/n — > E(Xi) as n — > oo, and A*(E(Xi)) = 0. 

O 

i In this paper, we show that this large deviation principle (LDP) is transfered to almost 

;> every path of the random walk, though the behavior of Sn(lu) /N is prescribed by the strong 
law of large numbers. To see this, we look at the deviations from this behavior over the 
blocks (X(j_ 1 ) n+1 (co'), • • • , Xj n (u)) of length n <^ N picked up in (Xi(u))i<i<N- Specifically, 
we define 

AS n (j,u) = S jn (u) - S(j-iy n (uj) 

and for N = k(n) ■ n with k(n) — > oo as n — > oo, we seek a LDP providing the almost sure 
asymptotic behavior of #{l < j < k(n) : AS n (j,u) G [n(x — e) ) n{x + e)]} and its possible 



connection with (1.1). Such a LDP would describe the local fluctuations of S n . We shall 
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obtain the following result as a special case of a more general statement (Theorem 2.3). We 
consider the random sequence of Borel measures (jin)n>i on ^ defined as 

«/m #{l<j<fc(n):Ag n (j,o;)/neg} 
^ n {B) = — — — (for every Borel set 23 J, 

K [Tl J 



as well as their logarithmic generating functions 

A£(A) = -log/ exp(nAa;)d^(x) = -log(-^-^ ) exp(AA5 n (j,a;))) (AG 



3=1 



Theorem 1.1. Let (k(n)) n >i be a sequence of positive integers. Let Xq G T>a and denote 
A'(Ao) as xq. 

(1) If liminf > A*(xo) then there exists a neighborhood U of Xq in T>\ such that, 

with probability 1, for all X £ U 

(1-2) Jim A-(A) = A(A), 

and 

lim lim - log fJ%([x - e, x + el) = —A*(x ). 

(2) //lim sup < A*(a;o) and e is small enough, with probability 1, for n large 

enough the set {l < j < k(n) : AS n (j,u) G [n(xo — e),n(xo + e)]| is empty. 

(3) If lim = A*(x ) then, with probability 1, for all t > 1 we have 

(1.3) hm A£(tA ) = A(A ) + {t- l)X x . 

Remark 1.1. (1) The previous result will be extended to weakly dependent sequences such 
that A(A) = lim^oo - logE(exp(A5' n )) exists as n tend to 00 for each A in an open interval. 
For such sequences, one also has a strong law of large numbers so that for each Uq>1, one 
has 

1 , nA . „ , „\ 1 

lim 

k— yoo 

hence 

(1.4) Jim hm ~tog(~£exp(AAS B 0>))) =A(A). 

3=1 



lim — log! -^exp(AAS' no (j,a;))) = — logE(exp(A5' no )), 
k^oc n k ~[ J n 



1 . fl k 



In Theorem 2.5 we give, in terms of the growth of log(/c)/n, a fine measurement of how 



I log [l E*=i exp(AA5 n (j, w))J is close to A(A). 

(2) Theorem 1 1 . 1 1 cannot be obtained as a consequence of (1.4). 

Let us show how Theorem |1.1| applies to the description of the dyadic expansion of real 
numbers. For t G [0, 1] and i > 1 denote by ti the i th digit of the dyadic expansion of t (the 
dyadic points, which have two expansions, are of no influence in our study): t = J2i>i t$~ % - 
Let P p stand for the Bernoulli product of parameter p G (0,1), so that the X^t) = ti 
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are i.i.d. Bernoulli variables of parameter p under F p (P1/2 is the Lebesgue measure). By 
the strong law of large numbers, for P p -almost every t, limjv->oo YaLi U/N = p. Here, 
X? A = R, A(A) = log(l - p + pexp(X)), A'(R) = (0,1), and A*(x) = x\og(x/p) + (1 - 
x) log((l — x)/(l — p)) = iif(P x |Pp) for all x G (0,1). As a consequence of Theorem |1.1[ 1), 
Jog js> Ttt,^ 

if liminf > — min(log(p), log(l — p)), for P p -almost every t, for all x G (0, 1) 

f^o^n l0g ^k^*i 1 - j : (- t.) e [n(a;-e),n(x + e )]}) = -A*(a;). 



(jr'-l)n<i<jn 



Once one has such a result, it is very tempting to investigate whether or not it highlights 
questions related to the distribution of digits for numbers suspected to be normal in a 
given integer basis m > 2, i.e. such that for every Uq > 1, for every finite sequence 
(ei, . . . , e no ) G {0, . . . , m — l} n °, the frequency of the occurrence of (ei, . . . , e no ) in the 
m-adic expansion of t = J2i>i tiin~~ l is equal to m~ n °, i.e. 

(1.5) lim }#{1 <i<k: (t h t i+no ^) = ( £l • • • e no )} = m~ n \ 

Indeed, for such numbers like the fractional part of Pi, numerical experiments support the 



conjecture that (1.5) holds, showing that these numbers share statistical properties with 
almost every realization of a sequence X of independent random variables uniformly dis- 
tributed in {0, ... ,m — 1}, and in this sense are "random". The recent discovery of the 
so-called BBP algorithm [2], to compute the n th digit without computing the preceding 
digits, has opened new perspectives on this question |3J . Theorem leads to strengthen 
the conjecture about the "randomness" of Pi: the sequence of digits of Pi in a given integer 
basis obeys the same large deviations properties as almost every realization of X (see con- 



jecture 4.1 for a precise statement). This conjecture, which implies the normality of Pi, is 
supported by numerical experiments presented in Section |4j 

We will obtain extensions of Theorem |l . 1 1 valid for a class of M d - valued stationary mixing 
processes. We will also obtain a general result concerning the transfer of LDPs valid for 
random walks taking values in a separable normed vector space to LDPs valid for the lo- 
cal fluctuations of almost every realization of such random walks. These results are stated 
in Section [2] and illustrated with several natural examples in Section |3j namely Brownian 
motion, Birkhoff sums on symbolic spaces and some of their geometric realizations, branch- 
ing random walks on Galton- Watson trees, and Poissonian random walks on Poisson point 
processes. The proofs of the main results are given in Sections [5] and [6] 

We end this section by recalling general facts about large deviations theory. 

General facts about large deviations theory. Let y be a topological space and By 
stand for the completed Borel a-field. Let (fi n )n>i be a sequence of probability measures 
on (y,By). Let / : y — > [0, 00] be a lower semi-continuous function. The domain of / is 
defined as T>i = {x : I(x) < 00}. 

One says (see [12] Ch. 1.2) that (jU n )n>i satisfies in y the LDP with rate function I if for 
all set r G By. 

(1.6) — inf I(x) < liminf — log/i n (r) < lim sup — log fi n (T) < — inf I(x). 
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The function / is said to be a good rate function if, moreover, for any a G R + the level set 
{x £ y : I(x) < a} is compact. 

One says that (fi n )n>i satisfies in y the weak LDP with rate function / if the upper bound 
in (1.6) holds when T is a compact subset of y. 

The sequence (/i„) n >i is said to be exponentially tight if for every a < oo there exists 
a compact set K a C y such that lim sup^^ - logfi n (K^) < —a. In this case, if (// n )„>i 
satisfies the weak LDP with rate function / then it satisfies the LDP with good rate function 
/ (see [12] Lemma 1.2.18). 

Large deviation principles have been derived successfully for various stochastic processes 
(see, e.g. [T51 H3J H5J H2]) as well as for dynamical systems (see, e.g. |33[ EH US])- 

The Gartner-Ellis theorem. It is sometimes possible to derive, or relate, such a principle 
with the logarithmic generating functions of the measures fi n whenever y is a topological 
vector space. In this paper, when we use such a connection, we take y = R d (d > 1). 
Then, the main tool is the Gartner-Ellis theorem whose statement requires the following 
assumptions and definitions (see p2] Ch. 2.3, and [12] Ch. 4.5.3 for a version in topological 
vector spaces). Let (•, •) stand for the canonical scalar product on R d . Let {[i n ) n >i be a 
sequence of probability measures on (IR d , B R d). For each n > 1 let 

A n (A) = - log / exp(n(A, x)) d/x n (x) (A G R d ). 

n JR d 

Assume (A): For each A G R d , A(A) = lim^^ A n (A) exists as an extended real number. 
Further, the origin belongs to the interior of 2\ = {A G lR d : A(A) < oo}. 

The Fenchel-Legendre transform of A is defined as 

A*(x) = sup{(A, x) - A(A) : A G R d } (x G R d ), 

and one sets V A * = {x G R d : A*(x) < oo}. 

Definition 1.2. y G R d is an exposed point of A* if for some A G R d and all x ^ y G R d , 

(X,y) — A*(y) > (A,x) — A*(x). Such a A is called an exposing hyperplane. 

Definition 1.3. Let L : R d -+ R U {oo} and V L = {A G R d : L(X) < oo}. The function L 
is said essentially smooth if: 

(a) T>l is non-empty. 

(b) L is differentiable throughout T>l. 

(c) L is steep, namely, lim^oo |VL(A n )| = oo whenever (A n ) n >i is a sequence in T>l con- 
verging to a boundary point of T>l- 

Remark 1.2. Corollary 25.1.2 of [36] ensures that the exposed points of A* are precisely 
those x of the form VA(A) for some A G T>\. Moreover, Theorem 25.5 of [36] ensures that 
A is differentiable almost everywhere in A G T)\, and if it is differentiable everywhere in 
A G T)\, then it is C 1 . 

Theorem 1.4. (Gartner-Ellis) Under the above assumption (A): 
(1) For any closed set F C R d , 

limsup — logyU„(F) < — inf A*(x). 



LARGE DEVIATIONS FOR THE LOCAL FLUCTUATIONS OF RANDOM WALKS 5 

(2) For any open set G C M. d , 

lim inf — log u„ (G) > — inf A*(x), 

where J 7 is the set of exposed points of A* whose exposing hyperplane belongs to T>a- 

(3) If A is essentially smooth and lower semi- continuous, then the LDP with good rate 
function A* holds in IR d for (fi n ) n >i- 

A local version of this result is the following. It can be deduced from the proof of Gartner- 
Ellis' theorem. Throughout, B(X,r) stands for the closed ball of center A and radius r. 

Theorem 1.5. Suppose that A = lim^ooAn exists and is finite over an open set T>. At 
any point XofDat which A is differentiate, one has 

lim hm ilog/.„(5(VA(A),6)) = -A*(VA(A)). 



Remark 1.3. In Theorem L5 we do not require that E T>. This is because one goes 
back to this assumption by a standard reduction, systematically used in the proof of the 
lower bound part of Gartner-Ellis' theorem, as follows. Fix any Ao G T> and replace fi n 
by the measure dp, n (x) = exp(n(A , x) — nA n (A ))d/i n (x). Then replace A„(A) by A n (A) = 
ilog/ Rd exp(n(A, x))dp, n (x) = A n (A + A ) - A n (A ). 

2. Large deviations principle for local fluctuations of random walks 
We need a few notation and definitions related to the notion of weak dependence. 

Let (O, A, P) be a probability space. Given two sub-cr-algebras U and V of A, their 
a- mixing coefficient is defined as (see [33] for a detailed account): 

(2.1) a(U,V) = sup{|P(E/)P(y) -¥(Ur\V)\ : U G U, V G V} . 

Let (E,T) be a measurable space. We consider X = (Xi)i & ^ + , a stationary process 
defined on (Q, A, P) and taking values in E. 
For each n > 1, we define 

(2.2) X^ = (X- n ^)i>i = • • • ,X in ))i>i. 
Let T denote the shift operation on E N +: 

T(x 1 ,x 2 ,...) = {x 2 , •••)• 
We will assume that X satisfies some mixing properties. 

The Rosenblatt |37J mixing coefficients (ax, m )m>o of the sequence (Xj) ie N + are defined 
as: 



(2.3) «x,o = 1/2 and ax, m = sup{a(a(Xfc : k < i), cr(Xj : j > i + m)) : i > 1} for m > 1. 
Then, for u G [0, 1], one defines 

a x l (u) = inf{m : ax, m < u}. 

For each n > 1 we let S n $>(X) stand for a measurable function of (X,TX, . . . ,T n ~ 1 X) 
taking values in a normed vector space (y, \\ ||) endowed with the completed Borel a-field 
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By. A typical example will be the Birkhoff sums Y^j=o ^(T^X) associated with a measurable 
function $ : £ N + ->■ y. 

For each n > 1, denote by /i„ the distribution of the random variable S n &(X)/n (viewed 
under P). 

If y — M. d , we define the sequence of logarithmic moment generating functions 

(2.4) A n (A) = -logEexp((A,S n $(X))) = -log/ exp(n(A, x)) d^x) (A G M. d ). 
n n jR d 

Our results will use assumptions among the following. They are divided into three types. 

(1) Large deviations properties. 

(Al) The sequence (fi n ) n >i satisfies in y the LDP with rate function denoted by I. 
(Al') y = R d , and there exists a convex open set V in R d such that 

A(A) = lim A n (A) 

n^oo 

exists and is finite for each A G V. 

(Al") y = R. d , and for each A G R. d , A(A) = lim^oo A n (A) exists as an extended real 
number, and the origin belongs to the interior of T?a = {A : A(A) < oo}. 

(2) Mixing properties. 

(A2) M h = Jo(a x 1 (u)) h du<oo for all h > 0. 

(A2') There exists 7 > and 9 > such that a x , m = 0(exp(— 7m 9 )). 

(3) Approximation properties. 

(A3) There exists a sequence (5 , n $ n ) n >i of functions from E n + to y such that each S n $ n 
depends only on the 2n first coordinates and 

5 n = sup \\S n $(z) - S n $ n (z)\\/n = o(l) as n ->■ 00. 

Condition (A3) holds in particular if S n ${X) is given by the Birkhoff sums of a function 
$ defined on E N+ and there exists a sequence of functions (3> n )n>i defined on depending 
on the n first coordinates only, such that sup^ gB N + ||$(^) — $ n (z)|| = o(l) as n tends to 00. 
If there exists an integer p > 1 such that $ depends on the p first coordinates only, then 
one can take $ n = $ for n large enough, and then 8 n = 0. 

Now, we introduce the family of (random) probability measures for which we obtain large 
deviations results. 

We fix an increasing sequence of positive integers (k(n)) n >i, and for every u G Q and 
n > 1, we define 

, k(n) 

^=y-,i: with x„» = ^(TW-^Co;))/^ 

In other words, for each Borel set B C lR d , we have 

#{1 < J < fc(n) : ^^(T^- 1 )^^))^ G B) 
= k{n) • 
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If y — R , we also define 



n 



Now we start with results on the direct transfer of the LDP for S n Q(X) to the LDP for 
the local fluctuations of almost every realization of S n §(X). 

Theorem 2.1. Assume (Al-3). 

Jog J^(ff\ 

(1) Let x G T>j. If liminf > I(x) then, with probability 1, 

n— >oo ji 

lim liminf — log fi^(B(x, e)) = lim limsup — log ^(B(x, e)) = —I(x). 

e-s>0+ n^oo n 6-5-0+ n->oo H 

//limsup < I(x) then there exists e > such that, with probability 1, for n 

n— >oo Tl 

large enough the set [l < j < k{n) : S , n $(T^~ 1 ^ n X((X'))/?2 G -B(x,e)| is empty. 

(2) Let x ey\ V>i. With probability 1, 

lim limsup — log e)) = —I{x) = — oo. 

(3) If (fJ> n )n>i is exponentially tight, then so is (/i^) n >i almost surely. 

Theorem 2.2. Assume (y, || ||) is separable, as well as (Al-3). Suppose that sup x( z Vl I{x) < 

oo and liminf > sup I(x), or that lim = oo. With probability 1, (u^) n >i 

satisfies in y the weak LDP with rate function I. If, moreover, (/i n ) n >i is exponentially 
tight, then (yU^) n >i satisfies in y the LDP with good rate function I . 

Next we give results concerning the transfer of convergence properties for A n to conver- 



gence properties for A^. It is worth mentioning that under the assumptions of Theorem 2.2 



if one has additional information like ||S' n $|| 00 = 0(n), then Varadhan's integral lemma 
(see [12j Th. 4.3.1) together with Theorem 2.2 directly provides the almost sure pointwise 
convergence of A^ to A as n — > oo. 

Theorem 2.3. Assume (Al') and (A2-3). Let X G T> at which A is differentiable and 
denote VA(Ao) as x - 

(1) If liminf > A*(x ) then there exists r > such that B(\ ,r) C T> and, with 

n— >oo Tl 

probability 1, A^ converges uniformly to A over B(\o,r). 

(2) //liminf > A*(x ) then, with probability 1, 

(2.5) lim lim - \og^(B(x , e)) = -A*(ar ). 

//limsup < A*(xo), there exists e > such that, with probability 1, for n 

n— >oo n 

large enough the set {l < j < k(n) : S n <&{T^~ l ^ n X{uj)) / n G B(xo,e)} is empty. 
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(3) If lim = A*(xo) and t > t— > A(tAo) is strictly convex at 1 then, with 

probability 1, for all t > 1 we have 
(2.6) lim A^(tAo) = A(A ) + (t - l)(A ,x ). 



Remark 2.1. (1) The almost sure large deviations equality (2.5 ) provided by Theorem 2.3 2) 



is a direct consequence of Theorem |2.3[ 1) and Theorem 1.5 

(2) In Theorem |1.1 1 (3), since we consider a sequence of i.i.d. real valued random variables, 
if t > t- >■ A(tAo) is not strictly convex at 1 this means that X is constant, and the result 
obviously still holds. 



The following result is a direct consequence of Theorem 2.3 1) and 1.4 
Corollary 2.1. Assume (Al") and (A2-3). 

Suppose that sup xeT>A A*(x) < oo and liminf > sup A*(x), or lim 



oo. With probability 1, A^ converges uniformly to A on the compact subsets ofD^, hence 
the assertion of parts (1) and (2) of Theorem l.Jf. hold for (/i^) n >i- If, moreover, A is 



essentially smooth and lower semi- continuous, the assertion of part (3) of Theorem 1.4 
holds for (fJ>n)n>i! i- e - (A*n)n>l satisfies in M. d the LDP with good rate function I = A*. 

Next we want to measure more finely how big must be k(n) for A^(A) to converge to A(A) 
when A is smooth. 

If y = IR d , for any n > 1 and any subset B of T>\ let 

S n A{B) = sup{|A(A) — A„(A)| : A G B}, 

5 n $(S) = (sup||A||)||S' n $-S' n $ n || 00 /n, 
Aes 

8 n {A,$)(B) = 5 n A(B) + 5 n ^(B), 
and if A is twice continuously differentiable, let 

A*(B) = sup{A*(VA(A)) : A G B}, 
6(B) = sup{|| VA(A)|| : A G B}, 

6(5) = sup U *AD 2 A(A)A : A G B 

i{B) = A*{B)+£ 2 (B), 

where D 2 A(A) stands for the Hessian matrix of A at A. 

If B C R d and p E R* + we define B p as {A G M. d : d(X, B) < p}, where d stands for the 
Euclidean distance. 

Theorem 2.4. Suppose that (Al'), (A2') and (A3) hold, and A is twice continuously 
differentiable over T>. Let B be a compact subset of T> and let p > such that B p C T>. 
Suppose that there exists a positive sequence (e n ) n >i converging to such that 

(2.7) E ex p(- v^[MM^))-^*(£)]k (d+3 ^ < oo. 

n>l 

Let f] > 0. With probability 1, for n large enough, 

(2.8) max|A-(A)-A(A)|<^(n,r / ), 
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where £(n,r)) = (rj + 2£ 1 (B))e n + 6 n {A,$)(B p ), or £(n,r)) = £{n) = e n (l + e n )/n + 
5 n (A, <&)(B p ) if B consists of only one point. 



Remark 2.2. (1) It follows easily from the proof of Theorem 2.4 (see ( 5.10| )) that if B = {A} 



then in (2.7) one can replace (d + 3/2) by 3/2 to get the same conclusions as in Theorem 2.4 



(2) If the Xi are i.i.d, a simple modification of the proof using Lemma 5.1 2) rather than 
Lemma 5.1[ 1) makes it possible to replace (d + 3/2) by (d + 1) in (2.7). 



(3) In the context described in Section 3.2, where X takes values in a symbolic space, 

^-valued potential $ and the law of 



and 5 n Q(B p ) are 



S n &(X) represents the Birkhoff sum of a continuous 

X is a Gibbs measure, we will give conditions under which both 5 n A(B p ) 
0(l/n). Then, a choice like e n = , ylog(n)/n and log(k(n))/n — A*(B) > Wj' \og{n)/n with 

()(<„) 



^yy' > d + 5/2 + 37£(I?p) ensures that (2.7) holds and £(n,rf) 



0(log(n)/n) 



Remark 2.3. As a first explicit example of situation to which Theorems |2.3| and 2.4 can 
be applied, let us consider products of random invertible matrices applied to a normalized 
vector. Let \x be a probability measure on GL m (M). Suppose that the support of [i generates 
a strongly irreducible and contracting semi-group (see Ch. Ill in [9] for the definition). 
Suppose also that exp(r max(log + ||x||, log + ||x _1 ||)) is /x-integrable for some r > 0. Let 
X = (Xi)i>\ be a sequence of independent random matrices distributed according to fi. Fix 
a unit vector x and set S n <&(X) = log \\X n ■ ■ • X\ • There exists (see Ch V.6 in [9]) a 
neiborhood T> of 0, independent of x, such that the limit A of A n exists and is analytic on 
T> (the derivative of A at is the upper Lyapounov exponent associated with 

In the case where the Xi take values in M. d and are i.i.d, we also have the following 
improvement of Theorem |2.4 

Theorem 2.5. Suppose that the Xi are i.i.d and take values in M. d . Suppose also that 
S n <&(X) = J2k=iXi, an d A(A) = logEexp((A, Xi)) is finite over a convex open subset of 
R d . 

Let B be a compact subset ofT> and let p > such that B p C T>. Suppose that there exists 
a positive sequence (e n ) n >i converging to such that 

]T exp ( - v^[log(A;(n)) - nA*(B)])e- {d+1) exp (6(^)e n n) < oo. 

n>l 



The same properties as in Theorem 2.4 hold, with £(n,r]) = (rj + 2£i(B))e n , or £(n,r}) 
£{n) = e n (l + t n )/n if B consists of only one point. 



Remark 2.4. If B = {A}, in Theorem 2.5 we can take e n = 7log(n)/n and \og(k(n))/n 

n)/n i 

7log(n)/n 2 + 7 2 \og(n) 2 /n 3 . 



A*(VA(A)) > v9 \og(n)/n with > d+2+ 1 -\ *AD 2 A(A)A. Then£{n) < (l + e n )e n /n 



3. Examples 

This section describes various contexts to which our results can be applied. We investi- 
gate applications to Brownian motion (Section |3.1 ), dynamical systems and number theory 
(Sections 3.2 and 3.3), branching random walks (Section 3.4) and Poissonian random walks 
(Section 3.5). 
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3.1. Fluctuations of the increments of Brownian motion. Let (W t )te[o,i] be a d- 

dimensional standard Browian motion defined on a probability space (Q, A, P). Let (k(n)) n >i 
be a sequence of positive integers. For each n > 1 and 1 < j < n we denote [(j — l)/n,j/n] 
by J n j and the increment of W over the interval J n j is then denoted by AW(J n j). 
For every u G f2 and n > 1, define 

^™ = E 4nj(«)> with ^n,i(^) = (k(n)/n) 1/2 AW(J k{n)J ). 
K \ n ) j=i 

In other words, for each Borel set B C IR d , we have 



# 1 < J < k(n) : (A;H/n) 1 / 2 Aiy(J fc(n)j ) G B 



The following result is essentially a refinement of Theorem |2.3| applied to a sequence of 
independent centered Gaussian vectors with covariance matrix the identity. We will give a 



short proof in Section 6.1 



Theorem 3.1. Let R > 0. Suppose that there exists a positive sequence (e n ) n >i converging 
to such that 

(3.1) E ex P ( " v^M*W) - n{\ + ^)R 2 /2)])e; i 1 < oo. 

n>l 



With probability 1, for every Borel subset V of -8(0, R), (1.6) holds for (//^) n >i, with rate 
function I(x) = \\x\\ 2 /2. 

The choice e n = jlog(n)/n and \og(k(n))/n — (1 + A /e^)i? 2 /2 > yj' \og{n)/n with VtY > 
2 yields gl| . 

We also have a functional result based on the LDP established by Schilder (see |12j Th. 
5.2.3): for n > 1, let u n stand for the distribution of W/y/n as a random element of Co ([0, 1]), 
the space of Revalued continuous functions <fi over [0, 1] such that 0(0) = 0. Then (v n ) n >i 
is exponentially tight and satisfies in Co([0, 1]) the LDP with good rate function 

1 f l <P'(t) 2 dt if^eH 1 

otherwise 



where H 1 stands for Sobolev space of absolutely continuous elements of Co([0, 1]) with square 
integrable derivative. 

It follows from Shilder's theorem that if X = (Xi)i>i is a sequence of independent standard 
Brownian motions and S n = X\+- ■ -+X n , the distributions of the variables S n /n also satisfy 
in C*o([0, 1]) the LDP with rate /. Consequently we get almost surely the LDP with rate 
/ for the local fluctuations of S n in the sense of Theorem 2.2 This essentially yields the 
following result. 

For each n > 1 and 1 < j ' < k(n) denote by Wk( n ),j the standard Brownian motion 
t G [0,1] H> k(n) l l 2 {W((t+(j -l))/k(n))-W{{j -l)/fc(n))). For every u G Q and n > 1, 
define 



= T7ZS £ ^fe(n)jH' With X Kn),M 



k(n),j 



w, 
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Theorem 3.2. Suppose that lim = oo. With probability 1, {fJ%)n>i satisfies in 

C*o([0, 1]) the LDP with good rate junction I. 

Remark 3.1. It is possible to combine the ideas developed in this paper with those of [32J 



to obtain results in the spirit of Theorem 3Ji for some Levy processes with jumps. 



3.2. Local fluctuations of Birkhoff sums and products of matrices with respect to 
Gibbs measures. Let S m stand for the one sided symbolic space over a finite alphabet of 
cardinality m > 2: S m = {0, . . . , m — 1} N+ . The set E m is endowed with the shift operation 
T({t„}^L 1 ) = {tn+i}^^. Let A be a m x m matrix with all entries equal to and 1 and 
such that A p is positive for some p > 1. Then let (E^T) be the associated topologically 
mixing subshift of finite type of (E m ,T), i.e. = {t G S TO : V n > 1, A tn ^ n+1 = 1}- 

We denote by A^(S^,T) the set of invariant probability measures under T. 

Forn > 1 we define £ An = {(h...t n ) G {0,...,m-l}" : VI < k<n-l, A tk , tk+1 = 1}. 

If t G Y,a and n > 1 we denote tx ■ • ■ t n by t\ n and for w G XU.n the cylinder {t G £a : 
£|n = w} is denoted [w]. 

The set is also endowed with the standard ultra-metric distance d(t,s) = m~'* As ', 
where \t A s\ — sup{n : t\ n = s\„}. 

If if) is a continuous function from to M, the topological pressure of ip is defined as 
P[T,i/i) = sup{z/(^) + h u (T) : z/ G .M(E A ,T)}, and one has (see [TO]) 

P(T,ip) = jirn - ^ sup exp(S n i{;(y)). 

n^oo n we 2 Ain yE[w] 

We say that if) satisfies the bounded distorsion property if 

supt> n < oo, where v n = sup \S n ip(t) — S n ip(s)\ < oo. 

n>l t,s£S A 

In this case, it is well known that sup{v(ip) + h v (T) : v G Ai (XU, T)} is attained at a unique 
and ergodic measure called the equilibrium state of ip (see [TUl [38j). We will denote it by 
v^. This measure is a Gibbs measure, in the sense that there exists a constant C > such 
that 
(3.2) 

V n > 1, Vt G XU, CT 1 exp(S n ^(t) -nP{T^)) < v^{[t\ n )) < C exp{S n iP{t) - nP{T^)). 

Moreover, if $ is a continuous mapping from X^ to IR d such that each component of $ 
satisfies the bounded distorsion property, then A G IR d t— y P(T, (A, $)) is a C 1 mapping from 
R d to E (see [3H] and 0). 

3.2.1. Results for Birkhoff sums. We fix a real valued potential ip on X^ satisfying the 
bounded distorsion property. Then, the process X defined as the identity map of X^ is 
stationary with respect to the ergodic measure v^. We also fix $, a continuous mapping 
from Tja to M. d and define (S n §(X)) n >i as the sequence of Birkhoff sums of $. 

Thus, setting (fi, P) = (XU, u^), the quantities introduced in Section [2] take the following 
form. For all n > 1, B G B R d and A G R d , 

H n (B) = u^({t G Y>a : S n $(t)/n G £} 
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and 



An (A) = 1 logEexp((A,S n $pO)) = - / exp((A, S n $(t)» dz^(t). 



Also, /i^ and A^ are denoted /i^ and A^ respectively and we have for t G E^, n > 1, and 
A G 

t , m #{1 < j < k(n) : S n <S>(T^ n t)/n G fi} 
= kin) ' 

and 

A* (A) = - log / exp(n(A,ar))d/i*(z). 



Due to the Gibbs properties of (3.2), A(A) = lim^oo A„(A) exists and takes the form 

A (A) = P(T,ip + (A, <&)) - P(T,ip). 

If, moreover, each component of $ satisfies the bounded distorsion property then A is C 1 . 
Thus, condition (Al") (hence (Al')) hold with V A = R d . Moreover, 6 n A(B) = 0(l/n) for 
bounded sets B. 

For (A2) to hold we must ask some mixing properties of v^. It is quite simple to see 
that (A2) holds under the stronger assumption that there exists 7 > and 9 > 1 such that 
&x,m = 0(exp(— 7 log(m) 61 )). Then, due to Theorem 1.11 in [4J, (A2) holds as soon as the 
modulus of continuity of ip, namely K,(ip, ■) satisfies tz(ip, 5) = 0(exp ( — 7 (log | log(5)|)| e ')j 
as 5 for some 7 > and 9 > 1. Also (A2') holds as soon as k(^/j,5) = 0(exp ( — 
7I log(^)! 61 )) as 5 — > for some 7 > and 9 > 0. 

The function $ being continuous on the compact set (E^, d), (A3) always holds since 
we can always approximate $ by a function $ n depending only on (ti,...,t n ) so that 
\\S n <S>-S n $ n \\ 00 <j: n k=1 K(<5>,m- k ) = o(n). 



Thus under the above conditions on $ and ip assuring (Al") and (A2) Theorem 2.3 



and Corollary |2.1 can be applied to this context and provide information regarding the 
convergence of A l n to A for i/^-almost every t. If, moreover, we assume that ip and the 
components of $ are Holder continuous, then A is analytic (see for instance Th. 5 in [38]) 
and (A2') holds, so that we can apply Theorem 2.4[ 

In fact, even if $ is only supposed continuous, (/i„)„>i satisfies in M. d the LDP with good 
rate function 
(3.3) 

'inf {P(T, V) - {K{T) + v{$)) : v G M(E A , T), z/($) = x] = A*{x) HxeVj 
00 otherwise 



Kx) 



where T>i = {^($) : v G .M (E^, T)}, and / is bounded over the compact convex set Dj. 
This LDP essentially follows from Theorem 6 of [46] (which deals with Holder potentials), 
and the duality between the pressure and entropy functions (see [T71 fT8l 14*41 122] for details 
and related works). Thus (Al) holds. It follows that we can apply Theorem 2.2 and transfer 
the previous LDP to the local fluctuations of S n <&: 

Theorem 3.3. /fliminf > sup A*(x), then for v^- almost every t, the sequence 

n ^°° n xeV! 

(/4J„>i satisfies in M. d the LDP with good rate function given by (3.3). 
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Thus, we can also deal with the cases where the function A is non differentiable at some 
A G M. d because (A, $} + ip have at least two equilibrium states with distinct entropies (see 
[38J p. 52 for instance). 

Some geometric applications. The previous results have applications to geometric realiza- 
tions of (Ea,T), for instance on repellers of topologically mixing C 1+e conformal maps of 
Riemannian manifolds. For such a repeller (J,f), they make it possible to describe the 
local fluctuations of log ||-D/|| almost everywhere with respect to any enough mixing 
Gibbs measure on (J, /); this means that while with respect to such a measure one ob- 
serves on almost every orbit an expansion ruled by a fixed Lyapounov exponent equal to 
^(log ||£)/(x)||), we can finely quantify local fluctuations with respect to this global prop- 
erty. The same can be done along the stable and unstable manifolds on locally maximal 
invariant sets of topologically mixing Axiom A diffeomorphisms (see |10] 123] for details on 
these dynamical systems). 

Another application concerns the harmonic measure on planar Cantor repellers of C 1+e 
conformal maps /; recall that given such a repeller J, this measure is the probability mea- 
sure \x such that for each t G J and r > 0, /j,(B(t,r)) is the probability that a planar 
Brownian motion started at oo attains J for the first time at a point of B(t,r). It turns 
out that fi is equivalent to the equilibrium state n v of a Holder potential ip on J (see jllj 
or |29j). Given another enough mixing Gibbs measure the ergodic theorem ensures that 
lim r ^ +log/v(5(£,r))/log(r) = (P(ip) - u^(ip)) / u^log \\Df\\) for ^-almost every t. Then, 
our result yields information on the fluctuations with respect to this behavior. Indeed, one 
can use the coding of (J, /) by a subshift of finite type thanks to a Markov partition and 
apply our results to the pair (S n (ip — P(ip)), S n \og \\Df\\). If we remember the origin of ip, 
this yields information on the local fluctuations of the Brownian motion around z^-almost 
every t. This can be made more explicit in the case that J is self-similar and homogeneous, 
for instance when J = K 2 with K the middle third Cantor set. There our results provide, 
for i/^-almost every t, information on the distributions of the values 

\og^{C jn {t)) _ x + S n ( V - P(ip))(f^(t)) + cW l < • < fc(n) 



log /v(C (j ~ 1)n (t)) jnu^ip - P{ip)) j 

where Cfc(t) is the triadic cube of generation k containing t and lim^oo e(k) = 0. 

The previous interpretations of our results about the local behavior of Gibbs measures 
can be extended to the case of Axiom A diffeomorphisms invoked above. 

Thus, to summarize, while [331 EU H6] provide large deviations with respect to the almost 
sure asymptotic behavior of Birkhoff sums on a hyperbolic invariant set endowed with a 
Gibbs measure, our results provide a natural complement by describing the fluctuations 
with respect to this behavior on almost every orbit viewed by this measure. 

The next two subsections briefly discuss extensions to norms of Birkhoff products of 
matrices of the previous properties of Birkhoff sums of potentials. 

3.2.2. Birkhoff products of positive matrices. Suppose that M is a mapping from 
to the set of positive square matrices of order d > 1, and fix an enough mixing Gibbs measure 
If the components My a re so that log(Mjj) has the bounded distorsion property then, 



one can apply Theorem 2.3 to S n $(X(t)) = S n $(t) = log \\M{t)M{T{t)) ■ ■ ■ M(T Tl 



with respect to v^. Indeed, the convergence of A n (A) for A G R comes from the Gibbs 
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property of and the subadditivity and superadditivity properties of S n Q(X), and the 
differentiability of A comes from the variational principle for subadditive potentials (see [20J 
for instance). If the components of M are only supposed continuous A n (-) still converges 
but may be non differentiable. It is then possible to extend the result explained in the 
previous subsection and show that (Al) holds for S" n $ with the good rate function / still 



satisfying (3.3) and bounded over Vj. The only difference is that here f($) is defined as 
lini n _ >oo n Je a Sn§{t) dt. 

3.2.3. Bernoulli products of invertible matrices. Suppose that we are given Mi, . . . , M m , 
m matrices of GL(d, C) such that there is no proper non-zero linear subspace V of C d such 
that Mi(V) C V. Then, define M(t) = M h and S n $(X(t)) = S n $(t) = log ||M tl ■ ■ ■ M t J 
for t G S m . Nice superadditivity and subadditivity properties (see |21| ) make it possible to 
extend the results of the previous section to this context. We do not enter into the details. 

3.3. Local fluctuations in the continued fraction expansion of Lebesgue-almost 
every point. The interval [0, 1) is endowed with the dynamics of the Gauss transformation 
/(0) = 0, f(t) = l/t — [l/t\ if t G (0,1). Then, the continued fraction expansion of an 
irrational number t G (0, 1) is represented by the sequence [cii(i); a^if)', ■ ■ ■ ; a n (t); • • • ], where 
ai(t) = [l/t\ and a n (t) = ai(/™ _1 (t)) = [^/ f n ^ 1 (t)\. The Gauss measure \i G whose density 
with respect to the Lebesgue measure on [0, 1) is 1/(1 + t) log(2) is ergodic with respect to 
/, and it possesses the strong mixing properties required in (A2) (see [8] for instance). Now 
let $(t) = logdi(t) for t G (0,1). An application of the Birkhoff ergodic theorem proves 
that for Lebesgue almost every £, one has S n $(£) = J2k=o l°g a fcW ~ n Jq 1 logai(t) dno(t)- 

Here we are concerned with the limit of A n (A) = rT x log Jq 1 exp(AS , n $(t)) dfio(t) whenever 
it exists. For each n > 1 and each sequence ai, . . . a n of integers let us denote by I ai , » ,a n the 
interval {t G [0, 1) : [oi(i); o,2(t)] ■ ■ ■', a n (t)] = [a\; a2\ ... ; a n }}. It is clear that the question 
reduces to studying n^ 1 logJ](a 1 ,...,a„)e(N + )"(c t i • • • a, n ) x \I au ... , a „ |; this sequence converges for 



A < 1 to a limit A(A) analytic in A (see Section 4 of [19]). Consequently, Theorem 2.3 



Corollary |2 . 1 1 and Theorem 2. 4| provide large deviations properties for the local fluctuations 



of log(ai(i)) + • • • + log(a n (t)) almost everywhere with respect to the Lebesgue measure. 

The previous example can be generalized by studying the local fluctuations of the Birkhoff 
sums associated with good potentials on the symbolic space over an infinite alphabet with 
respect to enough mixing Gibbs measures. We refer the reader to [19J and [40J for further 
examples and references. 

3.4. Local fluctuations of branching random walks (BRW) with respect to gen- 
eralized branching measures. 

Let (N, (^i, $i), (^2, $2), • • • ) be a random vector taking values in N + x (R x R d ) N +. 
In the sequel, the distribution of will define a supercritical Galton- Watson tree, on the 
boundary of which will live a Mandelbrot measure determined by (^1, ip2, ■ ■ ■), with respect 
to which we will look almost everywhere at the local fluctuations of a branching random 
walk whose distribution is determined by ($1, $2, • • • )■ Here, (^1, ip2, ■ ■ ■ ) and ($1, $2, • • ■ ) 
play roles analogous to the potentials ip and $ in the previous section. 

Let {(N u0 , (ip u i, $ u i), (ip U 2, ®u2), ■ ■ ■ )}u be a family of independent copies of the vector 
(N, (ipi, $1), (ip 2 , $2), • ■ ■ ) indexed by the finite sequences u = U\ ■ • ■ u n , n > 0, Uj G N + 
(n = corresponds to the empty sequence denoted 0), and let T be the Galton- Watson 



LARGE DEVIATIONS FOR THE LOCAL FLUCTUATIONS OF RANDOM WALKS 



15 



tree with defining elements {N u }: we have G T and, if u G T and i G N + then ui, the 
concatenation of u and i, belongs to T if and only if 1 < i < N u . Similarly, for each 
u ^ Un>of^+; denote by T(u) the Galton- Watson tree rooted at u and defined by the {N uv }, 

The probability space over which these random variables are built is denoted (T,A, P), 
and the expectation with respect to P is denoted E. 

Let us define the 1R U {oo}-valued convex mapping 

N 



L: AGM d ^logE ( $>xp(^i + (A, . 



i=i 

We assume that 

E^^exp(^i)) = 1, logE(^ ^exp(^)) <0 and e(( ^ exp(^i)) log + ( ^ exp(^i))) < oo. 

i=l i=l i=l i=l 

Then, it is known (see [3"T| |2Sl that for each u G Un>o^+) the sequence 

Y n(u) = exp(lp UVl H h Ipuvi-vj 

B=Dl-»„6T(«) 

is a positive uniformly integrable martingale of expectation 1 with respect to the natural 
filtration. We denote by Y(u) its P-almost sure limit. By construction, the random variables 
so obtained are identically distributed and positive. Also, the Galton- Watson tree T is 
supercritical. 

Now, for each u G U«>o^+) we denote by [u] the cylinder u ■ N + N+ and define 

v{[u]) = l T (u) exp(Vv H h VV--«J Y (u)- 

Due to the branching property Y(u) = ^i=i exp(ip U i)Y (ui) , this yields a non-negative addi- 
tive function of the cylinders, so it can be extended into a random measure z/ 7 (7 G T) on N+ + 

endowed with the Borel cr-field B = £>(N+ + ). This measure has <9T = f|n>o U«=Mi - «„erM as 
support. 

Now, let VI = T x N+ + . We can define on (Q, A® B) the probability measure 



F(A)= [ [ l A ( 1 ,t)du 1 (t)dP(z). 



Then, it is known (see [28j for instance) that the random variables X n (7, £) = $^...^(7) are 
i.i.d. with respect to P. If, moreover, VL(0) exists then it equals E(Xi) and [X\ + • • ■ X n )/n 
tends to VL(0) P-almost surely. In terms of the BRW Ya=i &ti-u on T, this means that with 
P-probability 1, for z/ 7 -almost every t G <9T, we have lim^oo Ya=i ^h-uij) /n — VL(0). 

Moreover, in the present context, if we set X = (JQ)j>i, since the Xi are i.i.d. we have 
A(A) = logEexp((A, Xi)) = L(A). Consequently, if L is finite on an open convex subset T> 
of R d , local fluctuations of the BRW Ya=i $ti» 

. t . are described P-a lmost surely ^ 7 -almost 
everywhere thanks to Theorem 2.3, Corollary 2.1 and Theorem 2.4 When $^ G {0, 1} for 
all i > 1, this is related to percolation on the Galton- Watson tree T (see |26j). 
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3.5. Local fluctuations of Poissonian random walks and covering numbers with 
respect to compound Poisson cascades. 

As in the previous section, the probability space over which we are going to define random 
variables is denoted (T, A, P), and the expectation with respect to P is denoted E. 

Let £ > and V a Poisson point process in M x (0, 1] with intensity A given by 

£dsdA 



A(dsdA) 



A 2 



For every (s, A) G V let J(s, A) = (s, s + A). The question of knowing whether M + \ {0} is or 
not almost surely covered by the intervals J(s, A) has been raised in [30] in connexion with 
a similar problem previously raised in [13] for random arcs on the circle. These problems 
have been solved in [HI W2\ (see also [21] for further information on this question). Then, 
works [Tfij E] have been dedicated to the geometric heterogeneity of the asymptotic behavior 
of the covering numbers defined as follows (in fact, all the works mentioned above consider 
more generally the case of Poisson intensities invariant by horizontal translation). Here we 
rather look at local fluctuations of these numbers. 

For every t G [0, 1] and n > 0, the covering number of t at height e~ n by the Poissonian 
intervals J(s, A) is defined as 

iV n ( 7 ,t) = !{J(-,A)}W = #{( s > ^) e V : A > e- n , t G J(s, A)} (7 G T). 

(s,X)eV, \>e~ n 

For every t G [0, 1], this covering number can be seen as the "Poissonian" random walk 
Sn(7, t) = Xl (7, £) + ••• + X n (j, t) associated with the random variables Xi(j, t) defined as 

Xi{l,t) = E !{J(-,A)}(*) 
(s,A)eP, 

e- i <A<e - (*- 1 5 

= #{(s, A) G P : e- 1 < A < e"^ 1 ), t G J(s, A)} (i > 1). 
The choice of A ensures that the Xi(-,t) are i.i.d. We can describe the fluctuations of S , n ( 7 , t) 



thanks to Theorems 2.3 and 2.5 by considering random measures on IR + , namely compound 
Poisson cascades [6J. In fact, the invariance by horizontal translation of the constructions 
makes it possible to restrict ourselves to [0, 1] without loss of generality. 

It turns out that we can also describe a more general model of Poissonian random walks 
in the spirit of branching random walks. To to this, we consider a random vector (?/>, <3>) G 
K x ]R d , and to each (s, A) G V we associated a copy (ipux), $(s,A)) °f (Vs ^) m sucn a wa Y 
that these random variables are independent and independent of V. 

For each t G [0, 1] and G {ip, $} we consider the random variables 

Xf{i,t)= E ^M) («>1, 7eT) 

(s.A)eP, 

e- i <A<e - (*- 1 ) 

as well as the Poissonian random walk 5*^( 7 ,t) = Xf( 7 ,t) + • • • + X^j, t). An easy cal- 
culation shows that for any (g, A) 6lx IR d , for every t G M+ one has Eexp (qS%(-,t) + 

(A, *)» = exp (<E(exp(# + (A, $» - 
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We define over [0, 1] the sequence of random measures introduced in [6J as 

(3.4) i/ 7tB (df) = (Eexp(^(-,t)))" 1 exp(^( 7 ,t))dt = exp(^( 7 ,t)-<E(exp(^)-l)))dt. 

Let r(q) = (1 — £)(1 — q) + £E(exp(g-?/>) — qexp(ip)). We assume that r'(l) < 0. Then, for 
P-almost every 7, v ljn converges in the weak-star topology to a fully supported measure z/ 7 
over [0, 1], and whose total mass has expectation 1 (see [6j). We can defined on Q = T x [0, 1] 
endowed with A <8> B([0, 1]) the probability measure 

F(A)= f I l A ( 7 ,t)dz/ 7 (t)dP( 7 ). 

J~£ J [0,1] 

Let 

L : A £ R d ^^E(exp(^+ (A,$)) - l). 

The random variables Xf(j,t) are i.i.d with respect to P, and it is not difficult to s ee that 
A(A) = L(A). Thus, if L is finite on an open convex subset T> of IR d , Theorems 2.3, Corol- 
lary 2.1 and Theorem 2.5 applied to X = (Xj)j>i with respect to P provide a description of 
the local fluctuations of 5* (7, t), P-almost surely, for i/ 7 -almost every t. 

4. Conjecture on the "randomness" of fundamental constants 

As mentioned in the introduction, our results lead us to formulate a new conjecture 
regarding how in any integer basis m the digits of fundamental constants such as the number 
Pi or the Euler constant look like almost every realization of a sequence of i.i.d random 
variables uniformly distributed in {0, • • • ,m — 1}. This conjecture implies the normality 
property. 

Recall the notations of Section 3^ Consider a lR d - valued continuous potential $ defined 
on E m = {0, . . . , m — 1} N+ endowed with the shift operation denoted T. Consider a sequence 
(k(n)) n >i of positive integers. Recall that in Section 3.2.1 we have defined for t £ S m the 
sequence of Borel measures (/4J n >i an d logarithmic generating functions (Aj l ) n > 1 as 

1 H n ) 

K \ n ) j=i 

and 

(4.1) A£(A) = - log / exp(n(A,x))d/4(:r). 

Consider now the potential ip = and the associated equilibrium state u^, i.e. the measure 
of maximal entropy on (S m ,T). We have P(ijj) = log(m). 

The process X = (Xj)j>i defined on the probability space (S m , v^) as X(t) = (tj)j>i is 
a sequence of i.i.d random variables uniformly distributed in {0, ■ ■ ■ ,m — 1}, and the rate 
function / provided by (3.3) takes the form 

_ Jinf {log(m) -h v (T) : v £ M(E A ,T), z/($) = x] = A*(x) if x £ Vi 

1 [Xj < , 
[ 00 otherwise 

where £> 7 = {z/($) : v £ M{Y, A ,T)} and A(A) = P((A,$)) for A £ R d . 

Let D = (Di)i>i be a sequence of digits in the integer basis m. We say that D satisfies 
property (V) if 
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Property (V): The sequence (/i„) n >i obeys in Mr the same LDP with rate / as that 
provided by Theorem 3.3 for (/4J n >i (f° r ^-almost every t). and 

Theorem 4.1. Property (V) implies the normality of Y^i>iDim~ l in basis m. 

Proof. We prove the equivalent following fact: Property (V) implies that for all real- valued 
continuous function <f on E m , linip^oo S p if(D)/p = v^f). 

Let if be a real- valued continuous function on S m . For n > 1 let k\{n) = (2m) n . We 
have (n + l)/ci(n + 1) < Amnki(n) for all n > 1. Fix an integer n > 1, and for n > n , 
1 < i < Am - 1 and < £ < m n ° let k iti (n) = (i + £m~ n °)A;i(ri). The LDP of property (V) 
holds for every sequence (k(n)) n > no with /c(n) G : 1 < i < Am — 1, < £ < m n °}, 

since liminf^oo \og{k{n))/n > log(m). 

If p is a positive integer larger than let n p be the largest integer n such that 

nk\(n) < p. By construction, p < Amn p ki(n p ). Let (i p ,£ p ) be the unique pair in {(i,£) ■ 
1 < i < Am — 1, < i < m n °} such that n p ki p ^ p {n p ) < p < n p ki p / p (n p ) + n p ki(n p )m~ n ° . 

We have 

S p f(D) S n kt e (n p ) . _ . , . 

— = — — - + 0(771 °) (asp— >■ oo), 

P n P k ip/p (n p ) 

where the constant in 0(m~ n °) depends only on if. Consequently, since at fixed uq we deal 
with the finite number of sequences k(n) G {ki^(n) : 1 < % < Am — 1, < £ < m n °}, if 
we prove that the LDP of property (V) implies that lim^oo S n k(n)^{D) / nk{n) = v^[if) for 

each such sequence, we will get limsupp^^ | spip j D ' — u^((f)\ = 0(m~ n °). Then, letting n 
tend to oo will yield the desired conclusion. 

We reduced the problem to showing that lim^oo S n k( n )^p{D) /nk(n) = v^{f) whenever 
lim inf \og{k{n))/n > log(m). Suppose that liminf^oo \og{k{n))/n > log(m). For 
e > 0, we can write 



< I \x- Vj,fo)\dfi%(x) < e + 2\\ip\\ 00 ^({x : \x - ^(ip)\ > e}), 



nk(n) 

and due to property (V), /^({x '■ \ x ~ ^(v 9 )! > e }) tends to as n — > oo. Consequently, 
hm sup^ Snk ^ D) -M<P)\<e fOT a11 e > °- ° 

Remark 4.1. One can wonder if, conversely, the normality of Yli>i Difn~ l implies property 
(V) for D. To begin with this question, it is interesting to seek an explicit normal number in 
basis m for which property (V) holds; Champernowne's constant C m should be investigated. 

Our conjecture is the following. 

Conjecture 4.1. For every integer m > 2, the digits of the fractional part of either Pi or 
the Euler constant in basis m satisfy (V) . 



Conjecture AA_ is support ed b y numerical experiments, which focus on the validity of 
the conclusions of Theorem 2.3 for (A^) n >!. From the numerical point of view, the most 
tractable situations concern potentials that are constant over the cylinders of the first gen- 
eration. In the context of digit frequency associated to normality of numbers, it is natural 
to consider potentials of the form $ a (£) = l{a}(^i), with a G {0, ...,m — 1}. Here, we 

9 + exp(A) 

show simulation results when m = 10 and a = 0; in this case A(A) = log — and 
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A*(x) = xlog(10a;) + (1 — x) log(10(l — x)/9). We use the 160 millions first decimals of Pi 
and the Euler constant available at http : //www . numberworld . org/ constants . html| and 
http : //www . ginac . de/~kreckel/news . html] 



At first we consider a realization Xi, ■ ■ ■ ,X^ of N = 1.6 ■ 10 s independent random 
variables uniformly distributed in {0, . . . ,9}, that are viewed as the N first terms of the 
realization of an infinite sequence of such independent variables X±, ■ ■ ■ ,X n , ■ ■ ■ . In fact 
these digits are pseudo-random numbers provided by the Mersenne twister algorithm used 
in Matlab, so that actually we are also testing how such a sequence really looks like the 
theoretical one. 

At each scale n, we choose a number of intervals k(n) = exp(nA*(A'(Ao))) with Ao = 0.8, 
so that n ■ k(n) < N for n < 300. Due to the fact that $o(^) depends only on the first digit 
of t, A^ is constant over the cylinder [X\ ■ ■ ■ X n k< n )\ which contains the random sequence 
D = Xi ■ ■ ■ X n ■ ■ ■ , and we can estimate it easily. 

Let Ai and A 2 the two solutions of the equation A*(A'(A)) = A*(A'(0.8)). One has Ai ~ 
-1.45 and A 2 = A = 0.8. 

Figure [T|-(a)(left) illustrates the result of Theorem |2.3| (1) and (3): the empirical log- 
arithmic moment generating functions A^ converge to the function A over the interval 

(Ai,A2), and on (— oo, Ai] as well as on [A2,oo), A — A^ converges to A translated by an 
affine map. Figure [l}(a) (right) illustrates the same result in term of the Fenchel-Legendre 

transform (A^)*, which converges in the interval (xi,x 2 ), where x\ = A'(Ai) ~ 0.0254 and 
X2 = A'(A 2 ) — 0.1983 (the intervals of convergence are materialized by the dashed blue 
vertical lines). Moreover, on this figure one observes that the domain over which the func- 
tions (A®)* are finite, which cor responds to (A^)'(IR), converges to the interval [xi,x 2 ]. 



This is predicted by Theorem 2.3^2), since (A^)'(R) is equal to the smallest closed interval 
containing {S n ^(T n ^~ 1 ^D)/n : 1 <j < k(n)}. 

Figure [!}(&) numerically shows that, in terms of the convergence of the logarithmic mo- 
ment generating functions A^ and their Fentchel-Legendre transform, the first 160 million 
decimals of Pi behave exactly like the previous sequence X%, . . . ,Xn (though we do not 
expose the corresponding figures here, we verified that the same holds for all function $ a , 
a = 0, • • • ,9). The same conclusions hold for the 160 millions first decimals of the Euler 
constant, as shown on Figure [I]- (c). 



5. PROOFS OF THE MAIN RESULTS 

Recall that for any n > 1 and any compact subset B of T>, 5 n A(B) = sup{|A(A) — A n (A)| : 
A G B}, 6 n $(B) = (sup AeB ||A||)||£„$ - SAlU/n, A*{B) = sup{A*(VA(A)) : A E B}, 
= sup{||VA(A)|| : A G B}, &(B) = sup{| *AD 2 A(A)A : A G B}, and £(B) = 
A*{B)+t 2 {B). 

The following lemma and corollary of its first part will be precious for us. The first part 
of the lemma can be found in ([35], p. 61), and the second one in [I]. Recall that given a 
real valued random variable Y defined on (Q,A, P), its quantile function Qy is defined as 
the right-continuous inverse of the tail of Piyi, the probability distribution of \Y\, i.e. 

Q Y {u) = inf{t > : F(\Y\ > t) < u] (u> 0). 
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decimal digits of the Euler constant 

0.07 



< 



Q e 
< 



0.06 
0.05 
0.04 
0.03 
0.02 
0.01 
0. 



0.1 0.15 

X 



0.2 



— theo. 


- + 


n=20 


-A 


n=40 


-e 


n=80 


-* 


n=160 


-v 


n=300 
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FIGURE 1. Behavior of the logarithmic moment generating functions and 
their Fenchel-Legendre transform for t equal to an i.i.d sequence of random 
variables uniformly distributed in {0, ■ • • ,9} and for t equal to the the decimal 
digits of the number Pi and the Euler constant. 
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Lemma 5.1. (1) Let (Yj)j>i be a real valued and centered stationary process. For each 
p G (1, 2) and N > 1 one has 



E 



N p\ r 1 

)<C P N (ay^u^QYiufdu, 

.7=1 J J ° 



^ Let (y?)j>i &e sequence of complex i.i.d. random variables. For each p G (1,2) and 
N > 1 one /ias 



E 



E y 7 



< 2 p iVE(|yi 



Then, the fact that Jq 1 Qy(u) p du = E|y| p together with Holder's inequality yield 

Corollary 5.1. Let {Yj)j>\ be a real valued and centered stationary process. For each 
p G (1,2), e > and N > 1 



E 



N p\ / r l \e/(l+e) 

E F i J ^ ^ [ J (ay 1 (M)) (p_1)(1+e)A dwj (E|n 



(l+e)p'\ 1 /( 1 + e ) 



We start with the most technical results, namely Theorem 2.3, 2.4 and 2.5 



5.1. Proof of Theorem 2.3[ (1) For A G V and n>lwe have 
(5.1) 



I i 

A " (A) = n l0g Hn) E«P((^^*( TB °" 1) ^)»- 

Fix ro > such that _B(Ao,ro) C P. We must prove that we can find r G (0,ro) such that 
almost surely, for all A G B(X,r), A^(A) converges to A(A) as n — >■ oo. 

In order to exploit the mixing properties of the initial process (X 1; ■ ■ ■ ), we use the uniform 
approximation of S n <& by the functions S n $> n . For A G T> and n > 1 let 

1 1 K n ) 
A^"(A) = - log — - £ exp ((A, S^r*- 1 )*))) 



and 



A^(A) = -logEexp((A,5„$ n (X))). 
n 



By assumption (A3) for all A G B(\o,ro) we have 
(5.2) 

max(|A^(A)-A^A)|,|AW(A)-A n (A)|) < 5 n $(B(\ ,r Q )), with Jim 5 n $(5(A ,r )) = 0. 

Consequently, it is enough to find r G (0,ro) such that almost surely, for all A G B(Xo,r) we 
have lim^ A&*>' W (A) - A^(A) = 0. 
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For A G B(Xq, r ) we write 



A^(X) = AW(A) + -loj 



k{n) 



n 



j X:exp((A,5A(T^- 1 )X)) -nA^(A)) 



(5.3) 

where 
(5.4) 



zj n) (A) = exp ((A, 5 , ri $ n (T n(j_1) X)) - nA^(X)) - 1. 



Now we notice that (Zj n \\))j>i is stationary and centered, and each Z^ l \X) belongs to 
a(Xj n \ (recall (2.2)). Consequently, after writing 

k(n) 



e( £^ n) (A) 



< 2 P_1 E 



£ 4 n) (A) 

l<2j<A:(n) 



+ 2 P_1 E 



E 4;ii(A) 

l<2j'+l<fc(n) 



we can apply Corollary 5.1 and get for p G (1, 2) and e > 



(5.5) 
where 



E 



k(n) 



£zj n) (A) )<M(n,p,e)k(n)(E\Z[ n \X)\ 



(n) m I (l+e)p\ !/(!+£) 



M(n,p,e) = 2^ ( / (a~\ n) (u))^ 1+ ^ du 



From now on we fix p G (1,2) close enough to one and eo > small enough so that for all 
A G B = B(X ,r /2) and e G (0,e ) we have p(l + e)A e B = B(X ,r ). 

We have (using successively the convexity of u > t— > u^ 1+e ^ p and the subadditivity of 
v > t— > v l ^ l+tS> to get the second and third lines) 

(E\z{ n \X)\ {l+e)p ) 1/[1+e) 

V(l+e) 



< ^ 1+e )" 1 (E[exp ((1 + e)p(X, SA(T n ^X)) - n(l + e)pA^(X))] + 1 

< 2 p (l+exp 



(5.6) < 2 P 1+exp 



n 



n 



^ A(Kl + e)A) _ M(A) + 3 ^ (A ^ )( ^ 



1 + e 



where we have used (Al) and (A3). Since A is differentiable at Ao, by using the first order 
Taylor expansion of A at Ao, for each r G (0,r /2), uniformly in A G B(Xo,r) we have 



A(p(l + e)A) 
1 + e 



P A(X) = (l-p)(A(Ao)-(Ao,VA(A )))+^(e)e + 77(p J r)(p-l + r) 
= (p - 1)A*(VA(A )) + f (e)e + 7j(p, r )(p - 1 + r), 
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where £(e) is bounded over (0, eo) and r](p, r) tends to as p tends to 1 + and r tends to + . 
This yields 

(E|z{ rt) (A)|( 1+ ^) 1/(1+e) 



< 2 p (l + exp [n((p - 1)A*(VA(A )) + £(e)e + rj(p, r)(p-l + r) + 35 n {A, $)(£, 

< 2 P+1 exp [n((p - 1)A*(VA(A )) + £(e)e + rj(p, r)(p-l + r) + 35 n (A, $)(£<,)) 
the last inequality coming from the fact that A* > 0. 

Recall (A2). Let h = (p — 1)(1 + e)/e and notice that q"]„j < a^ 1 . This yields 



(5.7) 



M(n,p,e) < 2 p - l C p M t 



< OO. 



Thus, due to (5.5), for n large enough so that 35 n (A, $)(i?o) < e, uniformly in A G B(\q, r) 
we have 

fc(n) p ^ 

2( ]Tzj n) (A) J < 2 2p C p M e h /[1+e) k{n)exp [n((p-l)A*(VA(\))+£(e)e+r](p,r)(p-l+r] 

3=1 



where £(e) = £(e) + 1, hence 



(5.8) P 



k(n) 



fc(n) ^ J 



> e < e" p A;(n)- p E 



fe(n) 

E^l n) (A) 



< 2 2p C p M f h /{1+e) e~ p k(n) 1 - p exp [n((p - 1)A*(VA(A)) + f(e)e + 7?(p, r)(p - 1 + r))] . 

Let 7] > such that k(n) > exp(n(A*(VA(Ao)) + 77)) for n large enough. The previous 
inequality yields, for n large enough, uniformly in A G B(X ,r), 



P 



1 \ 

uri E Z i ( A ) > e ) ^ 2 2p L7 p M^ (1+e) e- p exp (n((l - p)r? + £(e)e + ^(p, r)(p - 1 + r))). 

ft W , = 1 



Hence, fixing p close enough to 1 and r small enough so that r](p, r)(p — 1 + r) < (p — \)r]/2, 
we get 



P 



1 fc M V 

— £Z< n) (A) >e) <2 2 ^ p Mf 1+ V p exp(n((l-p)r ? /2 + ^( e ) e )). 



Then, for every e small enough so that £(e)e < (p — 1)^/4 we have 



Ep 

n>l 



fe(n) 



A;(n) £J 



> e < 00 



for every A G -B(Ao, r) (notice that at fixed p, ft. tends to 00 as e tends to 0, this is why we 
need (A2)). Now, by the Borel-Cantelli lemma and (5.3), we can conclude that for every 
A G B(X ,r) we have lim^^ A^ n) ' w (A) - A^ n) (A) = almost surely, hence lim n _ ) . 00 A^(A) = 
A(A) almost surely. From this we deduce that, with probability 1, lim^oo A^(A) = A(A) for 
every A in a countable and dense subset of B(X ,r). Since the functions A^(A) and A are 
convex, we deduce from Theorem 10.8 in that almost surely, A(A) converges to A(A) for 
all A in B(X , r). 
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(2) The first part is a direct consequence of (1) and Theorem 1.5 



Now let Xq = VA(Ao). Let rj > such that k{n) < exp(n(A*(xo) — rf)) for n large enough. 
For e small enough, if n is large enough, we have 

P( 3 l<j< k{n) : S ^ Tb ' l)nX ) G B , t) \ < fc up( W e B(x e ~ 

< exp(— nrj/2) 



by Theorem L5j Thus, by the Borel-Cantelli lemma we see that if e is small enough, with 
probability 1, for n large enough {l < j < k(n) : s "^ Th n 1)nx ) g B(x , e)} is empty. 

(3) Let P : t > i-)> A(t\ ). Without loss of generality we assume that Ao 7^ 0. Notice 
that P is differentiable at 1 since A is differentiable at Ao, and our assumption on the 
strict convexity of A implies that A*(x ) = P*(P'(l)) > 0. Moreover, by our assumption 
on the strict convexity of A, at each point t G (0, 1) at which P is differentiable we have 
P'(t) < P'(l) and P*(P'(t)) < P*(P'(1)). Consequently, if we set P%(t) = A£(tA ), we 
deduce from Theorem |2.3[ 1) that with probability 1, for all t G (0,1), P£{t) converges 
to P(t) as n tends to 00. Now, we notice that for any s > 1, by the superadditivity of 
V > h-> y s and the definition of P%, we have for 77 G (0, 1) that P£(s(l - rj)) < sP£(l - 
rf) + (s — 1) log(k n )/n. Thus, due to our assumption on k(n), limsup^^ P^(s(l — 77)) < 
sP(l - rj) + (s — l)A*(x ). If t > 1, for each 77 G (0,1), if we set s = t/(l - rj), we 
get limsup^^P^t) < tP{l - rf)/{l - rf) + (t - 1 + ry)A*(x )/(l - 77). Consequently, 
limsup^ P%(t) < tP{\) + {t- l)A*(x ) = A(A ) + it - l)(A ,x ). 

On the other hand, by convexity, for all n > 1 and 77 G (0, 1), for t > 1 we have P^(£) > 
P n w (l -77) + (*- l + 77)(i^)'(l -77). Thus liminf^oo P£{t) > P(l~v) + {t-l + rf)P'{l-rf) 
for each 77 so that P'{l — rf) exists. Letting rj go to 1, we get lim inf^^ P£(t) > P(l) + (t — 
l)P'(l) = A(Ao) + (t — 1)(Xq,xo)- Thus we have the conclusion. 

5.2. Proof of Theorem |2.4 Since A is twice continuously differentiable, by using the 
second order Taylor expansion of A we can get for all A G Bp 12 and for all p G (1,2) and 
e > such that p(l + e)P p /2 C P p 

A(p(l + e)A) (p(l + e ) - 1)A*(VA(A)) + *(p, e) 



1+e " v 7 1+e 

where |5(p, e)| < ^{B p ){p{l + e) — l) 2 . Consequently, for all A G B p / 2 and for all p G (1, 2) 
and e G (0, 1/2) such that p(l + e)P p/2 C B p , 

A ^ 1 + e ^-M(A)-(p-l)A*(VA(A)) 



1 + e 

< A * (VA( f| £ £ + ^ e) < A*(VA(A))e + UB P )(P - if + ( P 2 e + 2 P (p - l))6(a„)e 

< 6(fl„)(p - l) 2 + (P 2 e + 2p(p - 1) + 1)(6(S P ) + 6(fl„))e 

= UB P )(P - l) 2 + (P 2 e + 2p(p - 1) + l)£(POe. 



Thus, for p close enough to 1 and e close enough to 
A ^ 1 + e)A) -,A(A)-(p-l)A*(VA(A)) 



1 + e 



<£ 2 (P P )(p-l) 2 + 2£(P P )e. 
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Let (k(n)) n >i and (e n ) n >i be as in the statement, and take e = e n and p = p n = 1 + JT^. 



Denning the variables Zj as in the proof of Theorem 2.3[ by using (|5.6|) and (|5.8|) we can 

get 



(5.9) 



F ( 7^ E Z fW > en) < S^M^^e^ exp(-nr(n, A))) 
v K \ n ) j=i / 



with h n = (p„-l)(l+e n )/e„ and r(n, A) = ((p„-l)(log(A;(n))/n-A*(VA(A))-6(B p )(p„- 
l) 2 -2f (£ p )e n -35 n A(£ p )-35 n $(£,,)). We have (recall the value of C p given in LemmaOl)) 



o(e; 



3/2- 



as n tends to oo. Moreover, 

r{n, A) > r{n) = ^(log (A; (n))/n - A*(B)) - 3(£(5 p ) en + 5 n A(5 p ) + 8 n $(B p )), 

and an estimation provided at the end of this proof shows that = 0(1) as n tends 

to oo. Thus, 



P 



1 , \ \ 

fc( ^ E > en) = 0(e„- 3 / 2 exp(-nrH)). 



Now,let#(n) = llog 2 (Vd/e n )J +1 and G n (B p/2 ) = {(k u . . . , k d ) G Z d : (M"^, • • • , ^2"^)) G 
-Bp/2}- There exists a constant C(B p / 2 ) depending on the volume of B p / 2 only such that 
#Gn{B p/2 ) < C(B p/2 )e~ d , hence 



P(3A G G n {B p/2 ) : 



fc(n) 



> e r 



Ofo (8/a+<0 «p( 



-nr{n 



))). 



and due to (2.7), the Borel-Cantelli lemma ensures that, with probability 1, for n large 
enough, for all A G G n (B p / 2 ), Y%=i %\ W — e n- This can be used in (5.3) and 



combined with (5.2) to get for n large enough 

(5.10) sup |A£(A n ) - A(A n )| < (e n + e 2 n )/n + 5 n A(B p ) + 5 n <S>(B p ). 

Since {G n {B p / 2 )) n >i is increasing and (Jn>i G n {B p / 2 ) is dense in -B p /2, the convexity of A^ 
and A ensures that converges uniformly to A over B p / 2 , and VA^ converges uniformly 
to VA over B (see [SB], Th. 10.8 and 25.7). Thus, for any rj > 0, if n is large enough, 
we have both (5.10) and sup AgB ||VA^ — VA|| < n/2, so that for all A G B, we can choose 
A n G G n {B) such that ||A — A n || < e n , hence 

K(A)-A(A)| < K(A„)-A(A n )| + |A-(A)-A-(A n )| + |A(A)-A(A n )| 

< (e n + e 2 )/n + 5„A(E P ) + 5„$(5 P ) + (rj/2 + 2 max ||VA||)||A - A n || 



< {n + 2 max ||VA||)e„ + 5„A(5 P ) + 
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It remains to prove that M^J { ' = 0(1) as n tends to oo. Due to (A2'), there exists 
C > such that a I (u)" 1 < C\\og(u)\ 1/e for all u G (0, 1]. This yields for h > 

M h = l\a x {u)- l ) h du< f 1 C h \\og(u)\ h / du 
Jo Jo 

= C h T(l + h/6) = o(c ft (iV(M)/e) W) M(M)), 

where N(h,6) = \ h/6\ +1 and we have use Stirling's formula. 

Now, we can use the fact that h n = (1 + e n ) / ^Je^ and the estimate above to conclude that 
M e„/(i+e„) = Q ^ ag n tendg tQ ^ 



5.3. Proof of Theorem 2.5 Here we have S n A = <5 n $ = 0, so that with respect to the 



proof of Theorem 2_A, we can consider the centered, independent and identically distributed 
variables 



Z nij (X) = exp ((A, S„$(T^X)> - nA(A)) - 1, 



instead of the z\ n) (X), with A(A) = log E (exp (A, X}). Now we can use Lemma 



5.1 



of Lemma 5.1[ 1). This yields, for p small enough so that pB C B p and A G B 



'2) instead 



E 



k(n) 



i=i 



E^i(A) j < 2^(n)(E|Z n)1 (A)| f 



< 2 2p - 1 A;(n)(l + EexpQo(A, S n $(T nj X)} - npA(X))) 
= 2 2p - 1 k{n)(l + exp (n{A{p\) -pA(A)))) 

< 2 2 ^Hexp(n[(p- 1)A*(VA(A)) + (p - l) 2 6(5p)]). 



The proof finishes as that of Theorem 12. 4[1 



5.4. Proof of Theorem 2.1 (1) Fix 6> > 0. For n > 1, we denote by juW the probability 
distribution of S n $ n (X) /n and by the empirical distribution of (S n Q n (J?^~ l >X{u))/n. 

Recall that (5 n )n>i is defined in (A3). Let B n G {£>(£, r — 5 n ),B(x,r + S n )}. We can 
estimate P( n^' u (B n )-^(B n ) > 9^{B n )) as when we get thanks to CorollaryloTl 
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Fix p G (1, 2), e > and h = (p - 1)(1 + e)/e. We have 
P(|^(S n )-#(S ri )|>^)(S n )) 
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fc(n) 



£ l Bn {SA{T^ n X)/n) - P(5„$ n (X)/n G fl„ 

3=1 



l Bn (S n $ n (X)/n) - F(S n $ n {X)/n G B r , 



V(i+6) 



-fc(n)P 

< 2 p - 1 L7 p M^ /{1+e) (^i n) (^n))^A;(n) 1 - p E 

< 2V- 1 C p Mf l+e \e^\B n ))-vk(n) 1 -v 

.(2^-\F(SA(X)/n G £„) +F(S n $ n (X)/n G 5 n ) p(1+e) 

< 2 2p - 1 C p M* /(1+ ^%) 1 -^^ 

< 2 2p C p M;/ (1+e) 6- p k(n) l - p ^\B n )- p+1/{1+e) 

< 2 2p C p M t h /(1+e) e- p k(n) 1 - p i2 n (B(x, r - 25 n ))- p+1/{1+e \ 

Now suppose that liminf > I(x) and let 77 > such that k(n) > exp(n(I(x)+r])) for 

n large enough. Since (/i n )n>i satisfies the LDP with rate function J, for n large enough we 
have fi n (B(x, r — 25 n )) > exp(— n(I(x) + tj/4)). Consequently, we can choose e small enough 
so that k(nY~ p jj, n (B(x,r — 25 n ))^ p+1 ^ 1+ ^ < exp(— n(j> — 1)t}/2) for n large enough. Then, 
the previous bound for T>(\^(B n ) - ^{B n )\ > 9^{B n )) yields En>i ¥^>{B n ) - 
fj,^(B n ) > 9fx^(B n )j < oo for any 9 > 0. Hence, by the Borel-Cantelli lemma we get that 

with probability one, lim n _ 



p(l+e)\ l/(l+e) 



i log = for B n G {B(x, r - 6 n ), B(x, r + 5 n )}. 



Moreover, we have fi^' ul (B(x,r — 5 n )) < /j%(B(x,r)) < n^ ,u) (B(x,r + 6 n )), and on the 
other hand we have jj n (B(x,r — 25 n )) < fi^'(B(x,r — 5 n )) < [i n (B(x,r)) < fi ( ^ l \B(x,r + 
5 n ))<^n(B(x,r + 25 n )). 

Consequently, for any r > 0, with probability 1, 

liminf log u n (B(x, r + 25 n )) < liminf log u%(B(x, r)) 

n—too Tl n— >oo yi 

< limsup log fi^(B(x, r)) < limsup log fi n (B(x, r — 25 n )). 

n— >oo Tl ri— >oo Ti 

This implies that with probability 1, for all r G we have 

1 1 
liminf log u n (B(x, 3r/2j) < liminf log u£(B(x,r)) 

n— >oo fi n— >oo yi 

< limsup log fi^(B(x, r)) < limsup log fi n (B(x, r/2)). 

n— >oo Tl n— >oo TL 

But since (jU n ) n >i satisfies the LDP with rate function /, we have for all y G y (see [12J, 
Th. 4.1.18) 

(5.11) lim —liminf — log u, n (B(y, s)) — lim — Hmsup — logu n (B(y, s)) — I(y). 

s^0+ n->oo n s->0+ n->oo Tl 

This, together with the previous inequalities yields the desired result. 
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At last, suppose that lim sup < I(x). An estimate similar to that used to establish 

n— >o o Tl 

the second part of Theorem 2.3[ 2) yields the desired result. 
(2) Let B e By and 7 > 0. If /!„,(£) > we have 

k(n) 

F(j%(B) > exp( ni )fi n (B)) < P( E l B (S n ^(T^ n X)/n) > k(n) exp(n 7 ) /!„,(£?) 

3=1 

fc(n) 

< A;H- 1 exp(-n 7 ) yUri ( J B)- 1 E £ l B (^ n $(T^- 1 ) n X)/n) 



3=1 



exp(— 727), 



and clearly if fi n (B) = then i-i%(B) = almost surely so that we also have F(fi^(B) > 
exp(nj)fi n (B)) < exp(-ny). 

Now fix r > and take B = B(x,r). Since J2n>i ex P(~ n l) < °°) the Borel-Cantelli 
lemma yields lim sup^^ Mog/i^(£>(x, r)) < 7+limsup 7l _ >00 - log/i n (S(x, r)) almost surely. 
This holds for all 7 > 0, so limsup^^ Mog/x^(5(rr, r)) < limsup^^ ^ \ogn n {B(x, r)) 
almost surely. This is enough to conclude thanks to (5.11) and the fact that I(x) = 00. 

(3) Let a < 00 and K a C^, a compact set such that limsup^oo - log ^ n {K^) < —2a. By 
using the estimate obtained above with B = K c a and 7 = a we get that with probability 1, 
limsup,^ ± log < -a. 



5.5. Proof of Theorem |2.2[ Our goal is to prove that, with probability 1, for all y G y 

we have 

(5.12) lim liminf log/j%(B(y, r)) = lim limsup log^(B(y, r)) = I{y). 

r->0+ n^oo n r-5-0+ n->oo 77 

Then, due to Theorem 4.1.11 in |12j . we have the desired almost sure weak LDP. 

Let X> be a dense countable subset of XV- We can deduce from the end of the proof of 
Theorem |2.1[ 1) that there exists a measurable subset Cl' of f2 such that P(fi') = 1 and for 
all u G Q', for all x G X> and for all r G we have 

liminf log// n (5(x, 3r/2)) < liminf log fj%(B(x,r)) 

< limsup log fj%(B(x, r)) < limsup log fj, n (B(x, r/2)). 

n— >oo 71 n— >oo 72 

Now let y G XV. For all s > we can find x G X> as well as a rational number < r < s 
such that B(y,s/A) C B(x,r/2) C B(y,s) C B(x,3/2r) C B(y,2s). Consequently, for all 
w 6 O', ?/ G P; and r > we have 

liminf-- log n n (B(y, 2s)) < liminf -- log f£(B(y, s)) 

1 1 

< limsup log fj%(B(y, s)) < limsup log fx n (B (y , s / A)) . 

Tl— >O0 Tl Tl— >oo Tl 
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Due to (5.11 ), for all u E fi,' and y E Vj we get 



lim liminf logu^iBCy, s)) 



lim lim sup 

s— >0+ n— >oo 



It 



log^ n (B(y,s))=I(y), 



that is Q5.12D for y G £>/. 

Now suppose that y \ Di ^ and let P' be a dense subset of y \T>j. Due to the 
facts established in the proof of Theorem 2.1[ 2), there exists a measurable subset Q' of 
Q such that P(O') = 1 and for all w G Q', for all x E V, for all r G we have 
limsup^^log/z^S^r)) < limsup^^ ^ log fi n {B{x,r)). 

Now, for all y E y \T>i and s > 0, we can find x E T>' and < s < r E Q such 
that B(y,s) C B(x,r) C B(y,2s), and the previous inequality yields, for all u' E 
limsup^^log/^S^s)) < limsup^^log/^-B^s)). This yields ( |5.12| ). 

6. Proofs of Theorem [371] and 13.21 



3.1, Each interval Jk(n),j can be decomposed into a union of n 



s Jk( n ),j,i °f length l/nk(n). The increments AVK( Jk(n),j,i) take 



6.1. Proof of Theorem 

consecutive closed interva 

the form (nk(n))~ 1 ^ 2 Xk( n ),j,i, where (Xk( n ),j,i)i<j<k(n) is a family of nk{n) centered Gauss- 

l<i<re(n) 

ian vectors of covariance matrix the identity. Thus &W(Ju n ),j) = (nk^n))^ 1 ^ 2 S n (j) with 
Sn(j) = T,i=iXk( n ),j,i- Let 

Z nJ (X) = exp ((A, S n (j)) - nA(X)) - 1, 



withA(A) = logE(exp(A,X fc(n ) J - i )) = ||A|| 2 /2, hence A*(VA(A)) = || A|| 2 /2 and *AD 2 A(A)A 
|| A || 2 for all A G M d . As in the proof of Theorem 2.5 we have 



E 



fe(n) 



£Z nj (A) J <2 2p - 1 A;H(l + exp(n(A(pA)-pA(A)) 
j'=i 

which, due to the special form of A, yields 

k(n) 



E 



( £ Z njj (X) P ) < 2 2 ?k(n) exp (n[(p - l)||A|| 2 /2 + (p - l) 2 ||A|| 2 /2 



Then, we can use the same approach as that used in the proof of Theorem 2A to get that 
under (3.1), with probability 1, 

1 



& A n(A) = -l0g 



1 



n 



k{n) Eexp((A,5 n (j)))) = A(A) = ||A|| 2 /2 



for a dense and countable subset of points A G B, hence for all A G B by convexity of the 
functions A£\ This is enough to get the result. 



6.2. Proof of Theorem 3.2 We let the reader adapt the lines of the proof of Theo- 
rem |2.2| to the present situation. The only change is that here for each n > 1 one must 



consider the i.i.d sequence of Brownian motions obtained by juxtaposition of the k(n) se- 
quences of n Brownian motions ((Wfc( n ),j,i)te[o,i]) 1< - < > 1 < j < k(n), where Wk( n ),j,i(t) = 
{nk{n)Y/\W{^ + - W(j^ + ^)), so thatV^/n^ = Sn {j)/n with S n (j) = 

J27=l W k(n),j,i- 
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